Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
1.
J Proteome Res ; 22(7): 2199-2217, 2023 07 07.
Artículo en Inglés | MEDLINE | ID: mdl-37235544

RESUMEN

Generating top-down tandem mass spectra (MS/MS) from complex mixtures of proteoforms benefits from improvements in fractionation, separation, fragmentation, and mass analysis. The algorithms to match MS/MS to sequences have undergone a parallel evolution, with both spectral alignment and match-counting approaches producing high-quality proteoform-spectrum matches (PrSMs). This study assesses state-of-the-art algorithms for top-down identification (ProSight PD, TopPIC, MSPathFinderT, and pTop) in their yield of PrSMs while controlling false discovery rate. We evaluated deconvolution engines (ThermoFisher Xtract, Bruker AutoMSn, Matrix Science Mascot Distiller, TopFD, and FLASHDeconv) in both ThermoFisher Orbitrap-class and Bruker maXis Q-TOF data (PXD033208) to produce consistent precursor charges and mass determinations. Finally, we sought post-translational modifications (PTMs) in proteoforms from bovine milk (PXD031744) and human ovarian tissue. Contemporary identification workflows produce excellent PrSM yields, although approximately half of all identified proteoforms from these four pipelines were specific to only one workflow. Deconvolution algorithms disagree on precursor masses and charges, contributing to identification variability. Detection of PTMs is inconsistent among algorithms. In bovine milk, 18% of PrSMs produced by pTop and TopMG were singly phosphorylated, but this percentage fell to 1% for one algorithm. Applying multiple search engines produces more comprehensive assessments of experiments. Top-down algorithms would benefit from greater interoperability.


Asunto(s)
Proteoma , Espectrometría de Masas en Tándem , Humanos , Proteoma/genética , Proteómica , Programas Informáticos , Procesamiento Proteico-Postraduccional
2.
Microbiol Spectr ; : e0382622, 2023 Feb 27.
Artículo en Inglés | MEDLINE | ID: mdl-36847572

RESUMEN

The genus Yersinia includes a large variety of nonpathogenic and life-threatening pathogenic bacteria, which cause a broad spectrum of diseases in humans and animals, such as plague, enteritis, Far East scarlet-like fever (FESLF), and enteric redmouth disease. Like most clinically relevant microorganisms, Yersinia spp. are currently subjected to intense multi-omics investigations whose numbers have increased extensively in recent years, generating massive amounts of data useful for diagnostic and therapeutic developments. The lack of a simple and centralized way to exploit these data led us to design Yersiniomics, a web-based platform allowing straightforward analysis of Yersinia omics data. Yersiniomics contains a curated multi-omics database at its core, gathering 200 genomic, 317 transcriptomic, and 62 proteomic data sets for Yersinia species. It integrates genomic, transcriptomic, and proteomic browsers, a genome viewer, and a heatmap viewer to navigate within genomes and experimental conditions. For streamlined access to structural and functional properties, it directly links each gene to GenBank, the Kyoto Encyclopedia of Genes and Genomes (KEGG), UniProt, InterPro, IntAct, and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and each experiment to Gene Expression Omnibus (GEO), the European Nucleotide Archive (ENA), or the Proteomics Identifications Database (PRIDE). Yersiniomics provides a powerful tool for microbiologists to assist with investigations ranging from specific gene studies to systems biology studies. IMPORTANCE The expanding genus Yersinia is composed of multiple nonpathogenic species and a few pathogenic species, including the deadly etiologic agent of plague, Yersinia pestis. In 2 decades, the number of genomic, transcriptomic, and proteomic studies on Yersinia grew massively, delivering a wealth of data. We developed Yersiniomics, an interactive web-based platform, to centralize and analyze omics data sets on Yersinia species. The platform allows user-friendly navigation between genomic data, expression data, and experimental conditions. Yersiniomics will be a valuable tool to microbiologists.

3.
J Mass Spectrom ; 58(3): e4909, 2023 Mar.
Artículo en Inglés | MEDLINE | ID: mdl-36822210

RESUMEN

In antibody-based drug research, a complete characterization of antibody proteoforms covering both the amino acid sequence and all posttranslational modifications remains a major concern. The usual mass spectrometry-based approach to achieve this goal is bottom-up proteomics, which relies on the digestion of antibodies but does not allow the diversity of proteoforms to be assessed. Middle-down and top-down approaches have recently emerged as attractive alternatives but are not yet mastered and thus used in routine by many analytical chemistry laboratories. The work described here aims at providing guidelines to achieve the best sequence coverage for the fragmentation of intact light and heavy chains generated from a simple reduction of intact antibodies using Orbitrap mass spectrometry. Three parameters were found crucial to this aim: the use of an electron-based activation technique, the multiplex selection of precursor ions of different charge states, and the combination of replicates.


Asunto(s)
Anticuerpos Monoclonales , Espectrometría de Masas en Tándem , Anticuerpos Monoclonales/análisis , Anticuerpos Monoclonales/química , Cromatografía Liquida/métodos , Espectrometría de Masas en Tándem/métodos , Secuencia de Aminoácidos , Procesamiento Proteico-Postraduccional
4.
Bioinformatics ; 38(5): 1261-1268, 2022 02 07.
Artículo en Inglés | MEDLINE | ID: mdl-34908131

RESUMEN

MOTIVATION: Protein-protein interactions (PPIs) are key elements in numerous biological pathways and the subject of a growing number of drug discovery projects including against infectious diseases. Designing drugs on PPI targets remains a difficult task and requires extensive efforts to qualify a given interaction as an eligible target. To this end, besides the evident need to determine the role of PPIs in disease-associated pathways and their experimental characterization as therapeutics targets, prediction of their capacity to be bound by other protein partners or modulated by future drugs is of primary importance. RESULTS: We present InDeep, a tool for predicting functional binding sites within proteins that could either host protein epitopes or future drugs. Leveraging deep learning on a curated dataset of PPIs, this tool can proceed to enhanced functional binding site predictions either on experimental structures or along molecular dynamics trajectories. The benchmark of InDeep demonstrates that our tool outperforms state-of-the-art ligandable binding sites predictors when assessing PPI targets but also conventional targets. This offers new opportunities to assist drug design projects on PPIs by identifying pertinent binding pockets at or in the vicinity of PPI interfaces. AVAILABILITY AND IMPLEMENTATION: The tool is available on GitLab at https://gitlab.pasteur.fr/InDeep/InDeep. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Redes Neurales de la Computación , Proteínas , Proteínas/química , Sitios de Unión , Unión Proteica , Diseño de Fármacos
5.
BMC Bioinformatics ; 22(1): 190, 2021 Apr 14.
Artículo en Inglés | MEDLINE | ID: mdl-33853521

RESUMEN

BACKGROUND: Harmonin Homogy Domains (HHD) are recently identified orphan domains of about 70 residues folded in a compact five alpha-helix bundle that proved to be versatile in terms of function, allowing for direct binding to a partner as well as regulating the affinity and specificity of adjacent domains for their own targets. Adding their small size and rather simple fold, HHDs appear as convenient modules to regulate protein-protein interactions in various biological contexts. Surprisingly, only nine HHDs have been detected in six proteins, mainly expressed in sensory neurons. RESULTS: Here, we built a profile Hidden Markov Model to screen the entire UniProtKB for new HHD-containing proteins. Every hit was manually annotated, using a clustering approach, confirming that only a few proteins contain HHDs. We report the phylogenetic coverage of each protein and build a phylogenetic tree to trace the evolution of HHDs. We suggest that a HHD ancestor is shared with Paired Amphipathic Helices (PAH) domains, a four-helix bundle partially sharing fold and functional properties. We characterized amino-acid sequences of the various HHDs using pairwise BLASTP scoring coupled with community clustering and manually assessed sequence features among each individual family. These sequence features were analyzed using reported structures as well as homology models to highlight structural motifs underlying HHDs fold. We show that functional divergence is carried out by subtle differences in sequences that automatized approaches failed to detect. CONCLUSIONS: We provide the first HHD databases, including sequences and conservation, phylogenic trees and a list of HHD variants found in the auditory system, which are available for the community. This case study highlights surprising phylogenetic properties found in orphan domains and will assist further studies of HHDs. We unveil the implication of HHDs in their various binding interfaces using conservation across families and a new protein-protein surface predictor. Finally, we discussed the functional consequences of three identified pathogenic HHD variants involved in Hoyeraal-Hreidarsson syndrome and of three newly reported pathogenic variants identified in patients suffering from Usher Syndrome.


Asunto(s)
Disqueratosis Congénita , Proteínas de la Membrana , Secuencia de Aminoácidos , Retardo del Crecimiento Fetal , Humanos , Proteínas de la Membrana/genética , Filogenia
6.
Bioinformatics ; 37(1): 89-96, 2021 Apr 09.
Artículo en Inglés | MEDLINE | ID: mdl-33416858

RESUMEN

MOTIVATION: One avenue to address the paucity of clinically testable targets is to reinvestigate the druggable genome by tackling complicated types of targets such as Protein-Protein Interactions (PPIs). Given the challenge to target those interfaces with small chemical compounds, it has become clear that learning from successful examples of PPI modulation is a powerful strategy. Freely accessible databases of PPI modulators that provide the community with tractable chemical and pharmacological data, as well as powerful tools to query them, are therefore essential to stimulate new drug discovery projects on PPI targets. RESULTS: Here, we present the new version iPPI-DB, our manually curated database of PPI modulators. In this completely redesigned version of the database, we introduce a new web interface relying on crowdsourcing for the maintenance of the database. This interface was created to enable community contributions, whereby external experts can suggest new database entries. Moreover, the data model, the graphical interface, and the tools to query the database have been completely modernized and improved. We added new PPI modulators, new PPI targets and extended our focus to stabilizers of PPIs as well. AVAILABILITY AND IMPLEMENTATION: The iPPI-DB server is available at https://ippidb.pasteur.fr The source code for this server is available at https://gitlab.pasteur.fr/ippidb/ippidb-web/ and is distributed under GPL licence (http://www.gnu.org/licences/gpl). Queries can be shared through persistent links according to the FAIR data standards. Data can be downloaded from the website as csv files. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

7.
J Phys Chem A ; 124(51): 10637-10648, 2020 Dec 24.
Artículo en Inglés | MEDLINE | ID: mdl-33170681

RESUMEN

We describe methods for physics-based protein design and some recent applications from our work. We present the physical interpretation of a MC simulation in sequence space and show that sequences and conformations form a well-defined statistical ensemble, explored with Monte Carlo and Boltzmann sampling. The folded state energy combines molecular mechanics for solutes with continuum electrostatics for solvent. We usually assume one or a few fixed protein backbone structures and discrete side chain rotamers. Methods based on molecular dynamics, which introduce additional backbone and side chain flexibility, are under development. The redesign of a PDZ domain and an aminoacyl-tRNA synthetase enzyme were successful. We describe a versatile, adaptive, Wang-Landau MC method that can be used to design for substrate affinity, catalytic rate, catalytic efficiency, or the specificity of these properties. The methods are transferable to all biomolecules, can be systematically improved, and give physical insights.


Asunto(s)
Proteínas/química , Algoritmos , Química Computacional , Interpretación Estadística de Datos , Simulación de Dinámica Molecular , Método de Montecarlo , Conformación Proteica , Pliegue de Proteína , Programas Informáticos , Termodinámica
8.
J Integr Bioinform ; 15(2)2018 Jun 21.
Artículo en Inglés | MEDLINE | ID: mdl-29927748

RESUMEN

Proteomic and transcriptomic technologies resulted in massive biological datasets, their interpretation requiring sophisticated computational strategies. Efficient and intuitive real-time analysis remains challenging. We use proteomic data on 1417 proteins of the green microalga Chlamydomonas reinhardtii to investigate physicochemical parameters governing selectivity of three cysteine-based redox post translational modifications (PTM): glutathionylation (SSG), nitrosylation (SNO) and disulphide bonds (SS) reduced by thioredoxins. We aim to understand underlying molecular mechanisms and structural determinants through integration of redox proteome data from gene- to structural level. Our interactive visual analytics approach on an 8.3 m2 display wall of 25 MPixel resolution features stereoscopic three dimensions (3D) representation performed by UnityMol WebGL. Virtual reality headsets complement the range of usage configurations for fully immersive tasks. Our experiments confirm that fast access to a rich cross-linked database is necessary for immersive analysis of structural data. We emphasize the possibility to display complex data structures and relationships in 3D, intrinsic to molecular structure visualization, but less common for omics-network analysis. Our setup is powered by MinOmics, an integrated analysis pipeline and visualization framework dedicated to multi-omics analysis. MinOmics integrates data from various sources into a materialized physical repository. We evaluate its performance, a design criterion for the framework.


Asunto(s)
Proteínas Algáceas/metabolismo , Chlamydomonas reinhardtii/metabolismo , Gráficos por Computador , Procesamiento Proteico-Postraduccional , Programas Informáticos , Proteínas Algáceas/química , Chlamydomonas reinhardtii/química , Imagenología Tridimensional , Modelos Estructurales , Oxidación-Reducción , Conformación Proteica , Mapas de Interacción de Proteínas , Proteoma/análisis , Proteómica/métodos , Realidad Virtual
9.
J Mol Graph Model ; 71: 192-199, 2017 01.
Artículo en Inglés | MEDLINE | ID: mdl-27939931

RESUMEN

The stereospecificity of aminoacyl-tRNA synthetases helps exclude d-amino acids from protein synthesis and could perhaps be engineered to allow controlled d-amino acylation of tRNA. We use molecular dynamics simulations to probe the stereospecificity of the class I tyrosyl- and glutaminyl-tRNA synthetases (TyrRS, GlnRS), including wildtype enzymes and three point mutants suggested by three different protein design methods. l/d binding free energy differences are obtained by alchemically and reversibly transforming the ligand from L to D in simulations of the protein-ligand complex. The D81Q mutation in Escherichia coli TyrRS is homologous to the D81R mutant shown earlier to have inverted stereospecificity. D81Q is predicted to lead to a rotated ligand backbone and an increased, not a decreased l-Tyr preference. The E36Q mutation in Methanococcus jannaschii TyrRS has a predicted l/d binding free energy difference ΔΔG of just 0.5±0.9kcal/mol, compared to 3.1±0.8kcal/mol for the wildtype enzyme (favoring l-Tyr). The ligand ammonium position is preserved in the d-Tyr complex, while the carboxylate is shifted. Wildtype GlnRS has a similar preference for l-glutaminyl adenylate; the R260Q mutant has an increased preference, even though Arg260 makes a large contribution to the wildtype ΔΔG value.


Asunto(s)
Aminoacil-ARNt Sintetasas/química , Simulación de Dinámica Molecular , Biosíntesis de Proteínas/genética , Tirosina-ARNt Ligasa/química , Adenosina Monofosfato/química , Aminoácidos/química , Aminoácidos/genética , Aminoacil-ARNt Sintetasas/genética , Escherichia coli/enzimología , Methanocaldococcus/enzimología , Mutación Puntual , Ingeniería de Proteínas , Tirosina-ARNt Ligasa/genética
10.
J Chem Theory Comput ; 12(12): 6035-6048, 2016 Dec 13.
Artículo en Inglés | MEDLINE | ID: mdl-27775883

RESUMEN

Multistate protein design explores side chain mutations, with the backbone allowed to sample a small, predetermined library of conformations. To achieve Boltzmann sampling of sequences and conformations, we use a hybrid Monte Carlo (MC) scheme: a trial hop between backbone models is followed by a short MC segment where side chain rotamers adjust to the new backbone, before applying a Metropolis-like acceptance test. The theoretical form and a practical approximation for the acceptance test are derived. We then compute backbone conformational free energies for two SH2 and SH3 proteins using different routes and protocols, and verify that for simple test problems, the free energy behaves like a state function, a hallmark of Boltzmann sampling.


Asunto(s)
Proteínas/química , Dominio Catalítico , Enzimas/química , Enzimas/metabolismo , Simulación de Dinámica Molecular , Método de Montecarlo , Estructura Terciaria de Proteína , Proteínas/metabolismo , Termodinámica , Dominios Homologos src
11.
Methods Mol Biol ; 1414: 77-97, 2016.
Artículo en Inglés | MEDLINE | ID: mdl-27094287

RESUMEN

This chapter describes the organization and use of Proteus, a multitool computational suite for the optimization of protein and ligand conformations and sequences, and the calculation of pK α shifts and relative binding affinities. The software offers the use of several molecular mechanics force fields and solvent models, including two generalized Born variants, and a large range of scoring functions, which can combine protein stability, ligand affinity, and ligand specificity terms, for positive and negative design. We present in detail the steps for structure preparation, system setup, construction of the interaction energy matrix, protein sequence and structure optimizations, pK α calculations, and ligand titration calculations. We discuss illustrative examples, including the chemical/structural optimization of a complex between the MHC class II protein HLA-DQ8 and the vinculin epitope, and the chemical optimization of the compstatin analog Ac-Val4Trp/His9Ala, which regulates the function of protein C3 of the complement system.


Asunto(s)
Proteínas/química , Sitios de Unión , Ligandos , Termodinámica
12.
Proteins ; 84(2): 240-53, 2016 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-26676967

RESUMEN

D-Amino acids are largely excluded from protein synthesis, yet they are of great interest in biotechnology. Unnatural amino acids have been introduced into proteins using engineered aminoacyl-tRNA synthetases (aaRSs), and this strategy might be applicable to D-amino acids. Several aaRSs can aminoacylate their tRNA with a D-amino acid; of these, tyrosyl-tRNA synthetase (TyrRS) has the weakest stereospecificity. We use computational protein design to suggest active site mutations in Escherichia coli TyrRS that could increase its D-Tyr binding further, relative to L-Tyr. The mutations selected all modify one or more sidechain charges in the Tyr binding pocket. We test their effect by probing the aminoacyl-adenylation reaction through pyrophosphate exchange experiments. We also perform extensive alchemical free energy simulations to obtain L-Tyr/D-Tyr binding free energy differences. Agreement with experiment is good, validating the structural models and detailed thermodynamic predictions the simulations provide. The TyrRS stereospecificity proves hard to engineer through charge-altering mutations in the first and second coordination shells of the Tyr ammonium group. Of six mutants tested, two are active towards D-Tyr; one of these has an inverted stereospecificity, with a large preference for D-Tyr. However, its activity is low. Evidently, the TyrRS stereospecificity is robust towards charge rearrangements near the ligand. Future design may have to consider more distant and/or electrically neutral target mutations, and possibly design for binding of the transition state, whose structure however can only be modeled.


Asunto(s)
Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Tirosina-ARNt Ligasa/química , Tirosina-ARNt Ligasa/metabolismo , Proteínas de Escherichia coli/genética , Simulación de Dinámica Molecular , Ingeniería de Proteínas , Estereoisomerismo , Termodinámica , Tirosina-ARNt Ligasa/genética
13.
J Comput Chem ; 37(4): 404-15, 2016 Feb 05.
Artículo en Inglés | MEDLINE | ID: mdl-26503829

RESUMEN

A computational protein design method is extended to allow Monte Carlo simulations where two ligands are titrated into a protein binding pocket, yielding binding free energy differences. These provide a stringent test of the physical model, including the energy surface and sidechain rotamer definition. As a test, we consider tyrosyl-tRNA synthetase (TyrRS), which has been extensively redesigned experimentally. We consider its specificity for its substrate l-tyrosine (l-Tyr), compared to the analogs d-Tyr, p-acetyl-, and p-azido-phenylalanine (ac-Phe, az-Phe). We simulate l- and d-Tyr binding to TyrRS and six mutants, and compare the structures and binding free energies to a more rigorous "MD/GBSA" procedure: molecular dynamics with explicit solvent for structures and a Generalized Born + Surface Area model for binding free energies. Next, we consider l-Tyr, ac- and az-Phe binding to six other TyrRS variants. The titration results are sensitive to the precise rotamer definition, which involves a short energy minimization for each sidechain pair to help relax bad contacts induced by the discrete rotamer set. However, when designed mutant structures are rescored with a standard GBSA energy model, results agree well with the more rigorous MD/GBSA. As a third test, we redesign three amino acid positions in the substrate coordination sphere, with either l-Tyr or d-Tyr as the ligand. For two, we obtain good agreement with experiment, recovering the wildtype residue when l-Tyr is the ligand and a d-Tyr specific mutant when d-Tyr is the ligand. For the third, we recover His with either ligand, instead of wildtype Gln.


Asunto(s)
Termodinámica , Tirosina-ARNt Ligasa/química , Tirosina-ARNt Ligasa/metabolismo , Tirosina/química , Tirosina/metabolismo , Sitios de Unión/efectos de los fármacos , Ligandos , Simulación de Dinámica Molecular , Método de Montecarlo , Proteínas Mutantes/química , Proteínas Mutantes/genética , Proteínas Mutantes/metabolismo , Unión Proteica , Tirosina-ARNt Ligasa/genética
14.
Future Med Chem ; 7(17): 2317-31, 2015.
Artículo en Inglés | MEDLINE | ID: mdl-26599419

RESUMEN

AIM: Molecular dynamics simulations and normal mode analysis are well-established approaches to generate receptor conformational ensembles (RCEs) for ligand docking and virtual screening. Here, we report new fast molecular dynamics-based and normal mode analysis-based protocols combined with conformational pocket classifications to efficiently generate RCEs. MATERIALS & METHODS: We assessed our protocols on two well-characterized protein targets showing local active site flexibility, dihydrofolate reductase and large collective movements, CDK2. The performance of the RCEs was validated by distinguishing known ligands of dihydrofolate reductase and CDK2 among a dataset of diverse chemical decoys. RESULTS & DISCUSSION: Our results show that different simulation protocols can be efficient for generation of RCEs depending on different kind of protein flexibility.


Asunto(s)
Simulación de Dinámica Molecular , Sitios de Unión , Quinasa 2 Dependiente de la Ciclina/química , Quinasa 2 Dependiente de la Ciclina/metabolismo , Ensayos Analíticos de Alto Rendimiento , Humanos , Ligandos , Estructura Terciaria de Proteína , Tetrahidrofolato Deshidrogenasa/química , Tetrahidrofolato Deshidrogenasa/metabolismo
15.
J Comput Chem ; 34(28): 2472-84, 2013 Oct 30.
Artículo en Inglés | MEDLINE | ID: mdl-24037756

RESUMEN

We describe an automated procedure for protein design, implemented in a flexible software package, called Proteus. System setup and calculation of an energy matrix are done with the XPLOR modeling program and its sophisticated command language, supporting several force fields and solvent models. A second program provides algorithms to search sequence space. It allows a decomposition of the system into groups, which can be combined in different ways in the energy function, for both positive and negative design. The whole procedure can be controlled by editing 2-4 scripts. Two applications consider the tyrosyl-tRNA synthetase enzyme and its successful redesign to bind both O-methyl-tyrosine and D-tyrosine. For the latter, we present Monte Carlo simulations where the D-tyrosine concentration is gradually increased, displacing L-tyrosine from the binding pocket and yielding the binding free energy difference, in good agreement with experiment. Complete redesign of the Crk SH3 domain is presented. The top 10000 sequences are all assigned to the correct fold by the SUPERFAMILY library of Hidden Markov Models. Finally, we report the acid/base behavior of the SNase protein. Sidechain protonation is treated as a form of mutation; it is then straightforward to perform constant-pH Monte Carlo simulations, which yield good agreement with experiment. Overall, the software can be used for a wide range of application, producing not only native-like sequences but also thermodynamic properties with errors that appear comparable to other current software packages.


Asunto(s)
Biología Computacional , Proteínas/química , Programas Informáticos , Algoritmos , Concentración de Iones de Hidrógeno , Modelos Moleculares , Simulación de Dinámica Molecular , Método de Montecarlo , Desplegamiento Proteico , Proteínas Proto-Oncogénicas c-crk/química , Tirosina/análogos & derivados , Tirosina/química , Tirosina/metabolismo , Tirosina-ARNt Ligasa/química , Tirosina-ARNt Ligasa/metabolismo , Dominios Homologos src
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...